Dopamine signaling predicts individual differences in reinforcement learning in humans
نویسندگان
چکیده
The tradeoff between short-term and long-term consequences is inherent in many important decisions, as retirement savings or health behavior. Different neural accounts have been proposed to underlie intertemporal choice behavior via differences in valuation or selfcontrol mechanisms. Recent studies showed that imagining specific future events reduced reward delay discounting and was associated with increased prefrontal-mediotemporal coupling. We investigated whether temporal lobe epilepsy patients who suffer from hippocampal sclerosis exhibit increased discounting of future rewards, dependent on the side of the sclerosis. Methods: Up to now, 12 patients with right (RMTL), 8 patients with left (LMTL) and 11 patients with bilateral (BMTL) hippocampal sclerosis have been included together with 30 patients with extra temporal lobe epilepsy (ETL). The groups did not differ with respect to age or gender. We elicited time-preferences using three different lists of choices. Subjects had to decide between a sooner and an increasing later amount of money in each row. The three lists differed with respect to the time horizons. Future discounting was measured by taking the switching points from the sooner to the later option. To control for effects of general risk aversion we also conducted an incentivized risk experiment. Subjects had to decide between a lottery with a winning probability of 0.5 and an increasing amount of a safe option. Again the switching point served as a measure for preference. In both time and risk experiments we adapted a similar design as in Dohmen et al. (American Economic Review, 2009). Results: MTL patients with leftor bilateral hippocampal sclerosis showed higher discounting of future rewards compared to right – MTL and ETL patients (mean switching points: LMTL 16.1; BMTL: 18.2; RMTL: 9.7; ETL 10.9, Kurskal-Wallis-Test of independent groups p=0.034). Conclusions: Our results show that patients with bilateral or pure left sided hippocampal sclerosis discount future rewards more steeply, highlighting the importance of the medial temporal lobe in intertemporal choices. Two processes could underlie this effect: first, damage to the hippocampus itself – which is involved in episodic future thinking (Addis et al., Human Neurosci., 2012; Schacter et al., Nat. Neurosci., 2007); second, the inhibition of the neural self-control mechanism through interictal discharges to the left lateral prefrontal cortex (LPFC). Further studies will have to investigate these effects in more detail. Acknowledgements: BW is funded by a Heisenberg-Grant of the German Research Council (WE 4427/3-1)
منابع مشابه
Love to Win or Hate to Lose? Asymmetry of Dopamine D2 Receptor Binding Predicts Sensitivity to Reward versus Punishment
Humans show consistent differences in the extent to which their behavior reflects a bias toward appetitive approach-related behavior or avoidance of aversive stimuli [Elliot, A. J. Approach and avoidance motivation. In A. J. Elliot (Ed.), Handbook of approach and avoidance motivation (pp. 3-14). New York: Psychology Press, 2008]. We examined the hypothesis that in healthy participants this moti...
متن کاملSingle dose of a dopamine agonist impairs reinforcement learning in humans: evidence from event-related potentials and computational modeling of striatal-cortical function.
Animal findings have highlighted the modulatory role of phasic dopamine (DA) signaling in incentive learning, particularly in the acquisition of reward-related behavior. In humans, these processes remain largely unknown. In a recent study, we demonstrated that a single low dose of a D2/D3 agonist (pramipexole)-assumed to activate DA autoreceptors and thus reduce phasic DA bursts-impaired reward...
متن کاملStriatal D1 and D2 signaling differentially predict learning from positive and negative outcomes
The extent to which we learn from positive and negative outcomes of decisions is modulated by the neurotransmitter dopamine. Dopamine neurons burst fire in response to unexpected rewards and pause following negative outcomes. This dual signaling mechanism is hypothesized to drive both approach and avoidance behavior. Here we test a prediction deriving from a computational reinforcement learning...
متن کاملA Model of Dopamine and Uncertainty Using Temporal Difference
Does dopamine code for uncertainty (Fiorillo, Tobler & Schultz, 2003; 2005) or is the sustained activation recorded from dopamine neurons a result of Temporal Difference (TD) backpropagating errors (Niv, Duff & Dayan, 2005)? An answer to this question could result in a better understanding of the nature of dopamine signaling, with implications for cognitive disorders, like Schizophrenia. A comp...
متن کاملCOMT Val(158) Met genotype is associated with reward learning: a replication study and meta-analysis.
Identifying mechanisms through which individual differences in reward learning emerge offers an opportunity to understand both a fundamental form of adaptive responding as well as etiological pathways through which aberrant reward learning may contribute to maladaptive behaviors and psychopathology. One candidate mechanism through which individual differences in reward learning may emerge is va...
متن کاملIndividual Differences in Nucleus Accumbens Dopamine Receptors Predict Development of Addiction-Like Behavior: A Computational Approach
Clinical and experimental observations show individual differences in the development of addiction. Increasing evidence supports the hypothesis that dopamine receptor availability in the nucleus accumbens (NAc) predisposes drug reinforcement. Here, modeling striatal-midbrain dopaminergic circuit, we propose a reinforcement learning model for addiction based on the actor-critic model of striatum...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013